A novel application of big data to measure trends in tourism: France, Spain and Denmark, January 2016 - March 2022

TOURism Flows in European destinations during and after the Covid-19 pandemic (TOURCO)

The purpose of TOURCO is to use computer-based algorithms to scrap large unique data on tourism flows from Tripadvisor. 
The project cover periods before, during, and after the Covid-19 pandemic of three European countries (France, Spain, and Denmark). 
These data make it possible to capture pre-trends and also changes that are a result of the pandemic.

Recommended citation: 
* Borowiecki, Karol J., Pedersen, Maja U. and Mitchell, Sara B. (2023). Using big data to measure cultural tourism in Europe with unprecedented precision. Discussion Papers on Economics, Working paper No 5/2023, University of Southern Denmark.
* Borowiecki, Karol J. and Pedersen, Maja U. (2024). A novel application of big data to measure trends in tourism: France, Spain and Denmark, January 2016 – March 2022. Mobile Lives Forum (MLF) report.


The data is presented in Borowiecki et. al. (2023) and (2024), together with a presentation of validity tests, to show the validity of using the collected data as a measure of tourism. Furthermore, the data is also presented and described in the Data Manual and Description - Tourism flows in European destinations during and after the Covid-19 pandemic (2023), by Sara Mitchell and Karol J. Borowiecki. 

Below are descriptions of each of the data modules provided along with the data manual:

ATTRACTIONS data module (attr.csv): Consists of a list of all tourist attractions listed on the respective country's *Things to do* page on Tripadvisor at the time of data scraping.

REVIEWS data module (reviews.csv): Consists of all reviews in all included languages for each respective attraction. 

REVIEWS data module (reviewsXX.csv): Consists of reviews in different "XX" languages together with a detailed LIWC text analysis of the review text. 

USERS data module (users.csv): Contains basic information on the users who wrote at least one review for at least one attraction in our sample of countries.

TRAVEL HISTORY data module (travelHistory.csv): Contains data on reviews written by users included in the user profile module.

For further details, refer to the Data Manual and Description. 

The project is funded and scientifically supervised by the Mobile Lives Forum, as part of its research program on the mobility transition. The Mobile Lives Forum is a research institute created by SNCF.

Contact: Karol Jan Borowiecki (kjb@sam.sdu.dk).